KMID : 0917520040110040029
|
|
Journal of Speech Sciences 2004 Volume.11 No. 4 p.29 ~ p.42
|
|
Implementation of Sound Source Localization Based on Audio-visual Information for Humanoid Robots
|
|
Park Jeong-Ok
Na Seung-You Kim Jin-Young
|
|
Abstract
|
|
|
This paper presents an implementation of real-time speaker localization using audio-visual information. Four channels of microphone signals are processed to detect vertical as well as horizontal speaker positions. At first short-time average magnitude difference function(AMDF) signals are used to determine whether the microphone signals are human voices or not. And then the orientation and distance information of the sound sources can be obtained through interaural time difference. Finally visual information by a camera helps get finer tuning of the angles to speaker. Experimental results of the real-time localization system show that the performance improves to 99.6% compared to the rate of 88.8% when only the audio information is used.
|
|
KEYWORD
|
|
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|